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1 

HUMAN IMMUNODEFICIENCY VIRUS CO-RECEPTOR VARIANTS 
ASSOCIATED WITH RESISTANCE TO VIRUS INFECTION 

The present invention relates to variants of 
5 the CCR5 human immunodeficiency virus type 1 (HIV-1) co- 
receptor which results in a resistance of CCR5-expressing 
cells to HIV-1 infection* The detection of such mutations 
may be used to identify individuals at lower risk for 
infection relative to the general population who, if 

10 infected, may exhibit slower progression to AIDS. 
Further, the present invention provides for oligonucleo- 
tide primers and probes and diagnostic methods for 
detecting the presence of such variants and for methods 
of inhibiting HIV-1 infection of a cell expressing the 

15 CCR5 receptor, comprising introducing, into the cell, a 
nucleic acid encoding a CCR5 variant. 

Although human immunodeficiency virus type 1 
(HIV-1) uses the T cell surface molecule CD4 as a primary 
receptor, successful viral entry into, and infection of a 

20 cell has been found to require the presence of a second 
molecule, or co-receptor (Clapham et al., 1997, Nature, 
388, 230-231) . Seven co-receptor molecules have been 
identified, each of which members of, or related to the 
family of chemokine receptors, which are G-protein 

25 coupled receptors having seven transmembrane domains. 

Chemolcines are protein having molecular 
weights from about 7-16 kDa which, acting as ligands at 
chemokine receptors, induce a rapid calcium influx and 
mediate a number of effects on the immune system (Murphy, 

30 Cytokine Growth Factor Rev., 1996, 7, 47-64). Examples of 
chemokines include macrophage inflammatory protein (MIP)- 
la and MlP-lb, a protein which is regulated on activation 
normally T cell expressed and secreted (RANTES) , monocyte 
chemoattractant protein (MCP)-l, MCP-2, MCP-3, MCP-4, 
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eotaxin and stromal-derived factor (SDF)-l (Clapham et 
al., 1997, cited). Chemokines are proteins that are 
classified into two groups based on the presence of a 
non-cysteine amino acid (X) between the first two CC of 
5 four cysteine residues appearing in their amino acid 
sequence, giving rise to the CXC (a) family and the CC 
(b) family. Receptors which specifically recognize CXC or 
CC chemokines are referred to, accordingly, as CXCR or 
CCR (Dynamics of HIV infection, Science and Medicine, 

10 1998, 36-45) . 

Two species of chemokine receptors which 
appear to be particularly relevant to HIV infection are 
CCR5 and CXCR4 , for which the natural ligands are MlP-la, 
MlP-lb and RANTES (CCR5) and SDF-1 (CXCR4). To date, most 

15 of HIV-1 clinical isolates studied appear to use CCR5 or 
CXCR4, or both, as co-receptors with CD4 for entry into 
cells, and the presence of chemokine ligand inhibits 
infection via the corresponding receptor. 

The cellular distributions of CCR5 and CXCR4 

20 are associated with the role of these molecules in the 
course of HIV-1 infection. CCR5, which is mainly 
expressed on macrophages and memory T cells, serves as, a 
co-receptor for infection by macrophage-tropic (M-tropic) 
strains of HIV-1, which are found throughout the course 

25 of infection, are preferentially involved in sexual 
transmission, of HIV-1 and are expressed by non-syncitium- 
inducing laboratory isolates which do not cause cell/cell 
fusion in T cell lines (Cocchi et al., Science, 1995, 
270, 1811-1815; Alkhatib et al., Science, 1996, 272, 

30 1955-1958; Choe et al., Cell, 1996,, 85, 1135-1148; Deng 
et al, Nature, 1996, 381, 661-666; Doranz et al, Cell, 
1996, 85, 1149-1158; Dragic et al., Nature, 1996, 381, 
667-673) . CXCR4, however, which is expressed on a broader 
spectrum of cells, including naive T cells, serves as the 
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co-receptor in late stages of infection for syncitium- 
inducing T cell tropic (T-tropic) strains of HIV-1 (Belul 
et -al., Nature, 1996, 382, 829-833; ~ Oberlin et al., 
Nature, 1996, 382, 833-835; Feng et al., Science, 1996, 
5 272, 872-877). Accordingly, the co-receptor which is more 
relevant to the initiation of HIV-1 infection appears to 
be CCR5 . 

Indeed, an association* has been drawn between 
rare individuals who remain persistently uninfected 

10 despite multiple sexual exposures to HIV and the presence 
of a 32 base pair deletion in the CCR5 gene (CCR5A32 : 
Samson et al., Nature, 1996, 382, 722; Liu et al., Cell, 
1996, 367) which results in a protein having a truncation 
of C-terminal residues 188-352 (including the fifth, 

15 sixth and seventh transmembrane domains) . Individuals 
heterozygous for this deletion, are however, susceptible, 
to infection (Dean et al.', Science, 1996, 273, 1856), 
although progression to AIDS may be slowed (Dean et al . , 
1996, cited; Samson et al., 1996, cited, Huang et al.,. 

20 Nature Med., 1996, 2, 1240-1243, Michael et al ., Nature 
Med., 1997, 3, 338-340). It has been proposed (Benkirane 
et al., J. Biol. Chem., 1997, 272, 30603-30606) that, 
coexpression of the CCR5A32 gene with wild type CCR5 gene 
results in trans-inhibition of the ability of CCR5 to. act 

25 as an HIV co-receptor, in which the CCR5A32 protein 
interferes with dimerization of CCR5 at the cell surface.. 
It has not, however, been confirmed that the dimerization 
of CCR5 occurs or is necessary for viral entry. 

The present invention relates to truncated 

30 variants of CCR5 which lack the portion of the molecule 
beginning with the third transmembrane domain. It is 
based, at least in part, on the discovery of a specific 
variant form of CCR5, termed CCR5m303, in which the gene 
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is mutated and creates a stop codon which arrests 
translation before the third transmembrane domain. 

Unexpectedly, the presence of both CCR5m303 
and CCR5A32 variant alleles in individuals was observed 
5 to confer resistance to infection by M-tropic strains of 
HIV-L. 

The present invention is also based on the 
fact that the CCR5m303 variant is unexpectedly more 
effective than CCR5A32 in trans-inhibiting the ability of 
10 wild type CCR5 to act as a co-receptor for HIV. . 
Therefore, individuals having a genotype which includes a 
wild type CCR5 allele and a CCR5m303 variant allele may 
be protected against infection by M-tropic strains of 
HIV. 

15 Accordingly, in a first series of embodiments, 

the present invention provides for compositions 
comprising a nucleic acid encoding the CCR5m303 variant 
and portions thereof which contain or which may be used 
to detect the m303 mutation. 

20 In a second series of embodiments, the present 

invention provides for methods of identifying the 
presence of the CCR5m303 variant in an individual,- 
wherein such methods may also include the identification 
of a second species of the CCR5 variant. The presence of 

25 the CCR5m303 variant, in conjunction with wild type CCR5 
or a second species of CCR5 variant, bears a positive 
correlation with resistance to infection with M-tropic 
strains of HIV-1 and may be indicative of slower pro- 
gression of disease in heterozygous individuals. 

30 In a third set of embodiments, the present 

invention provides for compositions comprising a nucleic 
acid encoding a CCR5 variant which comprises the first 
two transmembrane domains found in wild type CCR5 but 
lacks the remainder of the C-terminal end of the mole- 
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cule, and for the corresponding CCR5 variant proteins. 

In a fourth set of embodiments, the present 
invention provides for methods of inhibiting CCR5- 
mediated HIV infection of a cell comprising decreasing 
5 the number of functional CCR5 molecules present at the 
surface of the cell. Such methods include, but are not 
limited to, introducing, into the cell, a nucleic acid 
encoding a CCR5 variant which comprises the first two 
transmembrane domains found in wild type CCR5 but lacks 
10 the remainder of the C-terminal end of the molecule. 

Figure 1A-D. Infection of peripheral blood 
mononuclear cells (PBMC) from two unexposed uninfected 
(UU) and two exposed uninfected (ExU) individuals with 
CCR5-dependent (YU2) and CXCR4 -dependent (NL4-3) HIV 

6 

15 molecular clones after inoculation with 20 ng p24/10 
cells, with or without addition of RANTES (R) . Results 
are expressed as the amount of p24 antigen in culture 
supernatants, and are representative of four independent 
experiments. CCR5 genotype for the A32 deletion is indi- 

20 cated in parenthesis (wt/wt) = homozygous wild type; 
(wt/A)= heterozygous A32; (A/A)= homozygous A32; and 
wt?/A)= heterozygous A32 with an apparent wild type 
allele. 

Figure 2. Functional analysis of both CCR5 
25 alleles cloned from ExU2 using an £nv-mediated cell 
fusion assay. 

Figure 3A-D. Genetic analysis of CCR5 from two 
UU individuals, two ExU individuals, and two siblings of 
ExU2 . (A) DNA nucleotide and amino acid sequences of wild 
30 type (wt) and m303 alleles in a region spanning the 303 
mutation (SEQ ID NO: 11 and SEQ ID NO:12). (B) PCR ampli- 
fication of genomic DNA using specific primers for the 
A32 deletion (C) PCR amplification of the entire CCR5 
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gene from genomic (lanes 1-6) ■ and plasmid (lanes 7-9) 
DNA, followed by (D) Hindi digestion after amplifica- 
tion. A lkb DNA ladder was used as a marker (M) . Genomic 
DNA following A32 PCR analysis is designated as follows : 
5 ExU2 : Exposed Uninfected (m303/A32) ; 2.S : sister of 
ExU2 (wt?/A) 2.F : father of ExU2 (wt?/A) ; UU1 : 
Unexposed Uninfected (wt/wt) ; ExUl : Exposed Uninfected 
(A32/A32) ; UU2 : Unexposed Uninfected (wt/A) ; Plasmid 
DNA corresponds to wild type (wt) , A32, and m303 cloned 

10 alleles, wt? : apparent wild type allele. 

Figure 4A-B. Infection of PBMC from family 
members of ExU2 with CCR5-dependent (YU2) and CXCR4- 
dependent (NL4-3) viruses. Results are expressed as the 
amount of p24 antigen in cell-free culture supernatants . 

15 Day 12 of infection of PBMC from ExU2, 2.S, 2.F and UU2 

with 0.5 ng p24/10 6 cells of (A) YU2 or (B) NL4-3 HIV 

molecular clones. 

Figure 5A^-B. Cotransf ection experiments 
The present invention relates to variants of 
20 CCR5 which comprise the first two transmembrane domains 

but lack the remainder of the C-terminal portion of the 

molecule. Such variants lack transmembrane domains 3-7 as 

found in wild type CCR5. 

These transmembrane domains are positioned 
25 hereafter in reference to sequence SEQ ID NO: 12, which 

corresponds to the sequence of the CCR5 human gene (wild 

type), in which XXX means TGT and YYY means cysteine: 

- transmembrane domain 1 (TM1) corresponds to 
the nucleic fragment 333-407 of SEQ ID NO: 12 and to the 

30 protein fragment 32-56 of SEQ ID NO: 13; 

- transmembrane domain 2 (TM2) corresponds to 
the nucleic fragment 438-500 of SEQ ID NO: 12 and to the 
protein fragment 67-87 of SEQ ID NO: 13; 
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- transmembrane domain 3 (TM3) corresponds to 
the nucleic fragment 546-611 of SEQ ID NO: 12 and to the 
protein fragment 103-124 of SEQ ID NO: 13; 

- transmembrane domain 4 (TM4 ) corresponds to 
5 the nucleic fragment 663-740 of SEQ ID NO: 12 and to the 

protein fragment 142-167 of SEQ ID NO: 13; 

- transmembrane domain 5 (TM5) corresponds to 
the nucleic fragment 837-908 of SEQ ID NO: 12 and to the 
protein fragment 200-223 of SEQ ID NO: 13; 

10 - transmembrane domain 6 (TM6) corresponds to 

the nucleic fragment 945-1019 of SEQ ID NO: 12 and to the 
protein fragment 236-260 of SEQ ID NO: 13; 

- transmembrane domain 7 (TM7) corresponds to 
the nucleic fragment 1062-1142 of SEQ ID NO: 12 and to the 

15 protein fragment 275-301 of SEQ ID NO: 13. 

■ With reference to the amino acid sequence of 
wild type human CCR5 (SEQ ID NO: 12) , the present inven- 
tion relates to CCR5 variants which comprise a portion 
having . amino acid residues 1-101 of SEQ ID NO: 13; said 

20 sequence is represented at SEQ ID NO: 14 and SEQ ID NO: 15 
and spans the first two transmembrane domains of wild 
type CCR5 ; it therefore lacks the transmembrane domains 
3-7 as specified hereabove. 

Accordingly, the present invention provides 

25 for a purified CCR5 variant protein which comprises the 
first two transmembrane domains of wild type CCR5 but not 
transmembrane domains 3, 4 , 5, 6, and 7 (SEQ ID NO: 14 and 
SEQ ID NO: 15) . The CCR5 variant may further comprise 
additional heterologous amino acids as a fusion protein. 

30 The present invention also provides for 

nucleic acid molecules encoding such CCR5 variant 
proteins. Such nucleic acids may themselves be truncated 
or may comprise a termination codon at positions 54 0/ 
with reference to SEQ ID NO: 12. 
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In a specific non-limiting embodiment, the 
present invention provides for a purified protein which 
is the CCR5 variant CCR5m303, having an amino acid 
sequence as set forth in SEQ ID NO: 15 and for fusion 
5 proteins comprising the CCR5m303 variant joined to a 
heterologous protein sequence. The present invention also 
provides for purified and isolated nucleic acid molecules 
encoding such CCR5 variant proteins, including a nucleic 
acid having a sequence as set forth in SEQ ID NO: 14. 

10 The proteins of the invention may be prepared 

by synthetic techniques (Merrifield method, for 
instance), by cleavage of naturally derived CCR5 or, - 
preferably, by recombinant techniques. 

The' present invention also includes nucleotide 

15 fragments, in particular oligonucleotides, derived from 
nucleotide sequences such as defined above, and in 
particular the following oligonucleotides, which are. 
useful as primers : 

- SEQ ID NO:l (A32. sens) : GTCTTCATTA CACCTGCAGC TC 
20 - SEQ ID NO: 2 : (A32 inv) : GTGAAGATAA GCCTCACAGC C 

- SEQ ID NO: 3: (SE) : CCCAAGCTTA TGGATTATCA A 

- SEQ ID NO: 4 : (ASE) : GCTCTAGATC ACAAGCCCAC AGA 

- SEQ ID NO: 5 (SI): GGGCAACTAA ATACAT 

- SEQ ID NO: 6: (S2) : GCACAACTCT GACTGG 

25 - SEQ ID NO:7: (mDl) : TCC TTC TTA CTG TCC CCT TCT GG 

- SEQ ID NO:8: (mD2) : CCT GTG CCT CTT CTT CTC ATT TC 

The position of these different primers on the 
gene sequence are the following: 
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SEQ ID NO: Primer Position on the 

gene sequence 

1 D32sens 757 

- 2 D32inv 955 

5 3 SE** 240 

cccaagcttatggattatcaagtg 
bold sequence=CCR5 sequence 

4 ASE* 1298 

gctctagatcacaagcccacaga 
10 bold sequence=CCR5 sequence 

5 SI 173 

6 S2 1338 

7 mdl 475 

8 md2 - 935 

15 

**SE primer includes a restriction site Hindlll 
* ASE primer includes a restriction site XbAI . 

The present invention also includes nucleotide 
fragments, useful as primers, which comprises between 15. 
20 and 100 nucleotides of a sequence upstream or downstream 
nucleotide 540 (with reference to the wild type human, 
CCR5 gene) . 

The invention also relates to nucleotide 
fragments which are complementary to those above. 

25 The present invention also includes fragments 

which may be useful as probes ; among such fragments, one 
may cite SEQ ID NO: 10 and SEQ ID NO: 11. 

The nucleic acids of the invention may be 
incorporate into suitable vectors for cloning and/or for 

30 expression, and as such, may be operatively linked to 
appropriate promoter sequences, ribosome binding 
sequences, signal sequences, transcription termination 
sequences polyadenylation sequences, splice 

donor/acceptor sequences etc... Examples of such vectors 

35 include but are not limited to, herpes simplex viral 
based vectors such as pHSVl (Geller et al . , Proc. Natl. 
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Acad. Sci., 1990, 87, 8950-8954); retroviral vectors such 
as MFG (Jaffee et al., Cancer Res. 53, 2221-2226, 1993,), 
and in particular Moloney retroviral vectors such as LN, 
1NSX, LNCX, LXSN (Miller and Rosman, Biotechniques 7, 
5 980-989, 1989); vaccinia viral vectors such as MVA 
(Sutter and Moss, Proc. Natl. Acad. Sci . U.S.A., 89, 
10847-10851, 1992); adenovirus vectors such as pJM17 (Ali 
et al., Gene Therapy 1, 367-384/ 1994; Berker, 
Biotechniques 6, 616-624, 1988; Wand and Finer, Nature 

10 Medicine, 2, 714-716, 1996); adeno-associated virus 
vectors such as AAV/neo (Mura-Cacho et al., J. 
Immunother . 11, 231-237, 1992); lentlvirus vectors 
(Zufferey et al., Nature Biotechnology, 15, 871-875, 
1997); baculovirus expression vectors such as p2Bac, and 

15 plasmid vectors such as pcDNA3 and pcDNAl ( Invitrogen) , 
pET 11a, pET3a, pETlld, pET3d, pET22d, pET12a and pET28a 
(Novagen) ; plasmid AH 5 (which contains the SV4 0 origin' 
and the adenovirus major late promoter), pRC/CMV 
(Invitrogen), pCMUII (Paabo et al., EMBO J. 5, 1921-1927, 

20 1986), pZipNeo SV (Cepko et al., Cell 37, 1053-1062, 
1984), pSRa (DNAX, Palo Alto, CA) and pBK-CMV 7 pSPTg- 
T 2 FpAXK and pSPTg.2FXK (Schaleger et al., Proc. Natl. 
Acad. Sci. U.S.A., 94, 3058-2063, 1997). 

Suitable expression systems include mammalian 

25 cells, insect cells, yeast, bacteria, and plants. 

The present invention also provides for a cell 
into which any of the foregoing nucleic acids has been 
introduced. The cell may be a vertebrate cell such as a 
mammalian cell (including, but not limited to, a human 

30 cell), a bacteria cell, a yeast cell, a plant cell, or an 
insect cell. The nucleic acid may be introduced by trans- 
fection, injection, electroporation, transformation, cell 
fusion, or any other standard technique. 

The CCR5 variants of the invention either 
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prevent or inhibit infection of a cell by a HIV virus 
which requires CCR5 as a co-receptor. The ability of the 
CCR5 variants to block or inhibit infection may be 
confirmed in an Env-mediated cell fusion assay such as 
5 the GD4+ HeLa LTR/lacZ assay described hereafter in the 
examples . 

The present invention provides for a method of 
detecting the presence of a CCR5 variant which comprises 
the first two transmembrane domains (1 and 2) of wild 

10 type CCR5 but which lacks transmembrane domains 3, 4 , 5, 
6 and 7 in an individual, wherein the presence of such a 
variant bears a positive correlation with either a 
resistance to infection by M-tropic strains of HIV-1, a 
lower probability of infection or a slower progression .of 

15 disease if such infection has occurred. 

In particular, a determination that both CCR5 
alleles of an individual encode a CCR5 variant which 
either (i) comprises the first two transmembrane domains 
of wild type CCR5 but which lacks transmembrane domains 

20 3, 4, 5, 6 and 7 or (ii) is functionally defective (e.g., 
cannot be expressed at the cell surface) has a positive 
correlation with a resistance of the individual to 
infection by M-tropic strains of HIV-1 . 

One non-limiting example of a functionally 

25 defective CCR5 variant is the CCR5A32 variant. 

Further, a determination that one CCR5 allele 
of an individual is a wild type CCR5 allele and the other 
allele is a CCRS variant which comprises the first two 
transmembrane domains of wild type CCR5 but which lacks 

30 transmembrane domains 3, 4, 5, 6 and 7 may have a posi- 
tive correlation with a decreased risk of infection, 
relative to homozygously wild type individuals, and with 
slower progression of disease in the individual once 
infection with an M-tropic strain of HIV-1 has occurred. 
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The phrase "slower progression of . disease", as used 
herein, refers to a longer time interval between infec- 
tion and progression to a diagnosis "* of acquired 
immunodeficiency syndrome (AIDS) . 
5 In preferred embodiments', the methods of the 

invention are directed towards detecting the presence of 
the CCR5m303 variant in an individual. 

Non-limiting examples of genotypes having a 
positive correlation with resistance to infection to 
10 macrophage- tropic strains of HIV-1 would, include indi- 
viduals homozygous for CCR5m303, and individuals having 
one CCR5m303 allele and one CCR5A32 allele. 

Non-limiting example of genotypes having a 
positive correlation with a lower probability of infec- 
15 tion with macrophage- tropic strains of HIV-1 or a slower 
progression of disease once' infection had occurred would 
include individuals having one wild type CCR5 allele and 
one CCR5m303 allele. 

The presence of a CCR5 variant, as set forth 
20 above, or of wild type CCR5, may be accomplished by 
detecting the presence of a nucleic acid, or a protein 
which is characteristic of said variant or wild type 
CCR5 . < 

The instant invention also relates to a method 
25 of detecting and identifying the presence of at least a 
CCR5 variant in a biological sample, comprising : 

- bringing into contact the biological sample 
with a pair of primers selected from the group consisting 
of: 

30 SEQ ID NO:l and SEQ ID NO: 2 

SEQ ID NO: 3 and SEQ ID NO: 4 
SEQ ID NO: 5 and SEQ ID NO: 6 
SEQ ID NO: 7 and SEQ ID NO: 8 
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amplifying the CCR5 variant, eventually 
present in said biological sample, 

- demonstrating the amplification of the DNA 
fragment corresponding to the fragment flanked by the 

5 primers, for example by gel electrophoresis, 

- optionally, verifying the sequence of the 
amplified fragment, for example by specific probe 
hybridization, sequencing or restriction site analysis. 

Nucleic acid and amino acid sequences of wild 

10 type CCR5 are set forth herein as SEQ ID NO: 12, wherein 
XXX = TGT and YYY = Cys or SEQ ID NO : 13 . Nucleic acid, and 
amino acid sequences of the CCR5m303 variant are set 
forth herein as SEQ ID NO: 14 or SEQ ID NO: 15. The CCR5A32 
variant is characterized by the nucleic acid set forth in 

15 SEQ ID NO: 16, having a deletion of nucleic acid residues 
[position 793-824] and an amino acid sequence consisting 
essentially of 'residues 1-187 of SEQ ID NO: 13. 

Analysis may be performed using a suitable 
sample collected from the individual, including, but not 

20 limited to, a blood sample, and in particular of PBMC. 

The presence of the CCR5m303 variant .or 
CCR5A32 , variant in an individual may be detected by 
obtaining and sequencing the CCR5 alleles of the indi- 
vidual, or portions thereof spanning the 303 position or 

25 the A32 position, using standard techniques which 
preferably employ amplification . technology, with the 
primers and the probes hereabove defined. 

Using such technology, a biological sample 
collected from a subject to be tested is contacted with a 

30 pair of oligonucleotide CCR5-directed primers, under 
conditions which allow for the hybridization of the 
primers to nucleic acid template in the sample, the 
primers are extended under suitable conditions, 



WO 98/54317 



PCT/EP98/03437 



14 

dissociated from the template, and then re-annealed, 
extended, and dissociated such as to amplify the number 
of copies of CCR5 nucleic acid demarcated by the primers, 
and then the product of amplification may be charac- 
5 terized by electrophoresis, restriction endonuclease 
cleavage patterns, and/or nucleic acid sequencing using 
standard techniques. 

In particular preferred non-limiting embodi- 
ments of the invention, the presence, of the CCR5m303 

10 allele in an individual may be detected by determining 
that a CCR5 allele in the individual lacks of Hindi 
restriction enzyme cleavage site at the 303 position. 

Nucleic acid encoding a CCR5 allele, prepared 
from a subject (e.g., from PBMC) , may be tested for the 

15 presence or absence of this cleavage site, either 
directly, or, preferably, by amplification techniques. 
For example, the entire CCR5 gene sequence, or a portion 
thereof spanning the 303 site, may be amplified using the 
primers specified above. 

20 In one non-limiting specific example, for 

obtaining amplified DNA encoding the entire CCR5 gene,' 
genomic DNA may be amplified using SEQ ID NO: 3 and SEQ ID 
NO: 4 or SEQ ID NO: 5 and SEQ ID NO: 6. ' 

The restriction endonuclease Hindi may then 

25 be added to amplified DNA prepared from the subject being 
tested (and preferably, in parallel, to a corresponding 
DNA sample representing the wild type CCR5 gene), under 
conditions recommended by the enzyme manufacturer for 
cleavage. The reaction product may then be separated to 

30 reveal restriction fragments, for example by electropho- 
resis in a 1 percent agarose gel. The absence of a Hindi 
cleavage site at the 303 position is indicative of the 
presence of the CCR5m303 mutation. Where the entire CCR5 
sequence is analyzed in this matter, the CCR5m303 muta- 
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tion is associated with a single restriction fragment 
approximately 1.1 kb in size, compared with two fragments 
of approximately 0.7 and 0.4 kb in wild type and CCR5A32 
= alleles (the Hindi site between the fragments evidently 
5 being eliminated by the mutation) . 

Identification of the presence of the CCR5A32 
variant, in a specific non-limiting embodiment, may be 
performed using the primers SEQ ID NO: 1 and SEQ ID NO:2 
or SEQ ID NO:7 and SEQ ID NO:8. 

10 The present invention also includes a method 

of rapid detection of the presence of two simultaneous 
variants of the CCR5 gene, wherein the first variant is- a 
CCR5m303 variant and the second variant is a CCR5A32 
variant, wherein it comprises : . 

15 a step in . which the biological sample is 

brought into contact with the pair of primers including 
SEQ ID N0:7 and SEQ ID NO:8 and 

a step in which the product resulting from the 
interaction between the nucleotide sequence of the 

20 CCR5m303 allele and the CCR5A32 allele may be detected by 
any suitable means (probe hybridization, sequencing or 
restriction site analysis) . 

Identification of the foregoing variants may 
also be performed at the protein level, for example, "by 

25 subjecting a protein sample collected from an individual 
and subjecting such protein to Western blot analysis, 
wherein an antibody directed against CCR5 is used to 
identify CCR5 proteins expressed in the individual. The 
appearance of a CCR5 protein having a molecular weight 

30 which is lower than wild type [40.6 kDa], and preferably 
of about [13-21.5 kDa], is indicative of the presence of 
a CCR5 variant having the characteristics set forth 
above . 
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As such, the present invention provides for 
molecules, compositions and kits which may be used in the 
foregoing analysis. Such molecules include but are not 
limited to oligonucleotide molecules which may be used to 
5 detect nucleotide defects in a CCR5 gene which give rise 
to the variants describe above. Such oligonucleotides 
preferably have a length between 8 and 100 and more 
preferably between 18 and 50 bases in length, and may 
optionally be detectably labeled, for example, with 1 a 

10 radioactive or a non-radioactive compound. Specific non- 
limiting embodiments are oligonucleotides comprising the 
sequences. SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:5, SEQ ID 
NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:10 and SEQ ID 
NO:ll, wherein SEQ ID NO:l and SEQ ID NO:2 are particu- 

15 larly useful in detecting the presence of CCR5A32; 
SEQ ID NO: 5 and SEQ ID NO: 6 are useful in amplifying the 
entire CCR5 gene and SEQ ID NO: 7 and SEQ ID NO: 8 are 
useful in detecting the presence of both . CCR5A32 and 
CCR5m303. 

?0 Suitable kits for performing methods of the 

invention may comprise (i) at least a CCR5 oligonucleo- 
tide primer and (ii) a reagent for performing an amplifi- 
cation reaction therewith. The present invention further 
provides for oligonucleotide primers (SEQ ID NO: 7 and 

25 SEQ ID NO: 8) designed to amplify a fragment of the CCR5 ' 
gene spanning the 303 position but not comprising the 
entire gene; the region spanned may preferably be at 
least 0.5 kb in length. 

Because it has been observed that the CCR5m303 

30 variaint has a trans-inhibitory effect on the ability of 
wild type CCR5 to act as a suitable co-receptor for M- 
tropic strains of HIV-1, the present invention provides 
for methods of inhibiting infection of a CCR5-expressing 
cell comprising introducing, into the cell, a CCR5 
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variant which comprises the first two transmembrane 
domains (1 and 2) of wild type CCR5 but which lacks 
transmembrane domain 3, 4, 5, 6 and 7. In preferred 
embodiments, the CCR5 variant is CCR5m303. 
5 The CCR5 variant may be introduced into the 

cell by introducing, into the cell, a nucleic acid 
encoding the CCR5 variant, either by infection with a 
virus containing the CCR5-encoding nucleic acid (such as 
a retrovirus, adenovirus, adeno-associated virus, etc.) 

10 or by injection of nucleic acid into the cell or" 
surrounding tissue. Such introduction may be accomplished 
when the cell is part of a subject to be treated (an in 
vivo approach) or when the cell has been removed from the 
subject (an ex vivo approach) . 

15 As a specific, non-limiting example, nucleic 

acid encoding the CCR5m303 variant may be introduced 
selectively into cells which are CCR5 + and CD4+. In 
further specific non-limiting embodiments, expression of 
the nucleic acid encoding the CCR5 variant may be placed 

20 under the control of an inducible promoter or such that a 
trans-acting sequence is required (for example by an HIV- 
encoded factor, such as tat, so that expression in the- 
cell is not constitutive. 
EXAMPLE 

25 MATERIALS AND METHODS 

Determination of susceptibility of PBMC from 
UU or ExU individuals to infection, and of effects of 
RANTES on infection. 

Cultures of peripheral blood mononuclear cells 
30 (PBMC) collected from two unexposed uninfected (UU) and 
two exposed uninfected (ExU) individuals were inoculated 
with either CCR5-dependent (YU2) or CXCR4 -dependent (NL4- 
3) HIV molecular clones, using a virus inoculum of 20 ng 
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p24/10 6 cells, with or without addition of RANTES (R) , as 
follows. Ficoll purified human PBMC were activated by 1 
pg/ml PHA-P (Glaxo Wellcome, Paris, France) for 72 h in 
RPMI 1640 medium (Gibco/BRL Life Technologies, Cergy, 
5 France) containing 10% fetal calf serum (FCS) and 
infected with HIV-1 containing an amount of p24 antigen 
determined by ELISA (Dupont de Nemours, Wilmington, DL) 
for 2 h. Cells were washed three times with PBS and 

6 

cultured at 10 cells/ml in RPMI 164 0 . medium containing 
10 10% FCS and 20 ng/ml recombinant interleukin 2 (rIL-2) 
(EuroCetus, France) . . 

Every 3 days the culture fluid was harvested 
and replaced with fresh medium containing IL-2 and the 
amount of p24 antigen in the cell-free supernatants was 
15 measured by ELISA. Inhibition of infection by human 
recombinant RANTES (Biodesign International, Kennebunk, 
ME) was performed by adding 125 nmol final concentration 
at the time of infection and each time the medium was. 
replaced. 

20 Functional analysis of both CCR5 alleles 

cloned from ExU2 using an 7Env-mediated cell fusion assay. 

A CD4 + HeLa (lacZ) cell clone was generated 
in HeLa cells carrying a stably integrated lacZ gene 
under HIV-1 LTR control by stable transfection with a 

25 retroviral vector containing CD4 cDNA (P4 HeLa cell 
clone) (19) . Stable surface expression of the HIV-1 LAI 
envelope glycoprotein in human HeLa cells (HeLa LAI 
cells) was obtained by transfection of pMA243, an HIV-1 
provirus derived from an infectious HIV-1 LAI provirus in 

30 which the gag and pol genes are deleted and the nef gene 
is replaced by the cfo.fr drug resistance gene (20) . Stable 
surface expression of the HIV-1 Ada envelope glycoprotein 
in HeLa cells (HeLa Ada cells) was obtained by substi- 
tuting the HIV-1 LAI envelope glycoprotein by the HIV-1 
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Ada envelope glycoprotein in pMA243 (21) . HeLa cells 
lacking HIV-1 env expression (HeLa Aenv ceils) were 
obtained by transfection of pMA273, an env defective HIV- 
1 provirus (20) . CD4 + HeLa (LTR lacZ) cells were 
5 transiently transfected with pcDNA3-based constructs 
permitting the expression of either CCR5 wild type 
(CCRSwt), m303 (CCR5m303) or A32 (CCR5A32 ) cDNA. After 24 
h, cells were cocultured with HeLa Ada, HeLa LAI or HeLa 
Aenv cells. Cell fusion was evaluated after 24 h by 
10 measuring p-galactosidase activity in cell lysates and 
assessed as described in Oberlin et al., Nature, 1996, 
382, 833. 

Genetic analysis of CCR5 from two UU indi- 
viduals , two ExU individuals , and two siblings of ExU2 . 

15 The A32 fragment was amplified by PCR from 

PBMC genomic DNA using the pair of primers corresponding 
to SEQ ID NO:l and SEQ ID NO: 2. PCR was conducted with 1 
lag of genomic, DNA using 0.2 mM dNTPs, 0.2 pM primers, and 
1.25 U of AmpliTAq Gold polymerase (PE Applied 

20 Biosystems, Branchburg, NJ) for 35 cycles (94°C, 40 sec ; 
60°C, 40 sec ; 72°C, 40 sec) after an initial 10 min 
denaturation at 94 °C. The resulting PCR products were 
separated on a 2% nusieve agarose gel. Two fragments of 
198 bp and 166 bp corresponding to the wild type and the 

25 deleted CCR5 alleles were obtained. 

The entire CCR5 gene may be amplified : 
- either with the pair of primers SEQ ID NO: 5 
and SEQ ID NO: 6; the amplified product may be used for 
sequencing said gene ; 

30 - or with the pair of primers SEQ ID NO: 3 and 

SEQ ID NO: 4; in such a case, the amplified product may be 
used to cloning and sequencing of the gene. 

PCR of genomic and plasmid DNA (200 ng) was 
conducted as above for 30 cycles (94°C, 1 min30 ; 4 9°C 

35 (with pair of primers SEQ ID NO:5 and SEQ ID NO:6/55°C 
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(with the pair of primers SEQ ID NO:3 and SEQ ID NO: 4), 1 
min ; 72°C, 1 min30) after an initial 10 min denaturation 
''at 94°C. The amplified products were cloned in a pCR3 
vector using the TA cloning kit (Invitrogen, Leek, 

5 Netherlands) . Automatic sequencing was performed using 
the same primers (ESGS, Research and Development 
department, Evry, France). For analysis of restriction 
enzyme products of different CCR5 alleles, the entire 
CCR5 cDNA was amplified using the same set of primers as 

10 above, digested with Hindi, and the fragments separated 
on an agarose gel. 



variant and the CCR5m303 variant may be carried out by 
amplifying the gene with the pair of primers SEQ ID NO: 7 

15 and SEQ ID NO: 8. PCR was conducted with 1 ^ig of genomic 
ADN using 0.2 mM dNTPs, 0.2 primers, and 1.25 U of 

AmpliTAq Gold polymerase (PE Applied Biosystems, 
Branchburg, NJ) for 35 cycles (94°C, 40 sec; 56,5°C, 1 
min; 72°C, 40 sec) after an initial 10 min denaturation 

20 at 94 °C. The resulting PCR products were separated on a 
2% Nusieve agarose gel. For analysis of restriction* 
enzyme, the resulting PCR products are digested with 
Hindi, and the fragments separated on an agarose gel. 



The simultaneous 



detection of 



the 



CCR5A32 



The sizes of the obtained products are as 



25 



follows 



gene : 



wt 



4 60 bp 



A32 



428 bp 



m303 



4 60 bp 



30 



after Hindi restriction 



wt: 66 and 394 bp 
A32: 66 and 362 bp 



m303: 4 60 bp. 
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Infection of PBMC from family members of ExU2 
with CCR5 -dependent (YU2) and CXCR4 -dependent (NL4-3) 
viruses . 

Ficoll purified PBMC were activated as above 
5 for 4 days and infected after 1 day with 0.5 ng p24/lQ 6 
cells of either YU2 or NL4-3 HIV molecular clones in RPMI 
1640 medium supplemented with 10% FCS, 1 U/ml anti-alFN 
(Valbiotech, Paris, France) and 2 pg/ml polybrene (Sigma, 
France) . 

10 RESULTS AND DISCUSSION 

ExUl and ExU2 are two causasian homosexual men 
who reported multiple incidents of unprotected sexual 
intercourse during the last ten years. ExU2 reported 
numerous sexual relationships with multiple partners who 

15 succumbed to AIDS and is now the stable partner of an 
HIV-infected person. No evidence of HIV-1 infection was 
detected in either ExU individual by standard techniques 
(HIV-1 ELISA and RNA. PCR, Roche), nor was there evidence 
of clinical or immune alterations as determined by the 

20 CQ4/CD8 ratio and other parameters of cellular immunity. 

The CCR5 genotypes of ExUl and ExU2 were first 
determined by PCR of genomic DNA using a pair of primers 
which allow amplification of the DNA fragment containing 
the characteristic A32 base pair deleted sequence of 

25 CCR5A32 (2). ExUl was found to be homozygous for the 
deleted allele (A/A), while ExU2 displayed only one 
deleted allele (wt?/A) , where wt? indicates that one 
allele of CCR5 in this individual lacked the 32 base pair 
deletion and was therefore presumed to be wild type. 

30 There is evidence for an association between 

CCR5A32 homozygoty and resistance to infection (1,2) 
whereas the same deletion, when present as a heterozygous 
trait, does not confer resistance (3,11). Cells of 



WO 98/54317 



PCT7EP98/03437 



22 

CCR5A32 heterozygous individuals (like ExU2) are 
typically susceptible to infection by CCR5-dependent 
viruses (1) . 

To determine whether the presumably exposed 
5 but uninfected individuals were resistant to infection, 
PBMC from both individuals (ExUl and ExU2) were tested 
for their ability to be infected by two molecular clones 
of HIV-1: YU2, which uses CCR5 as a co-receptor (12) and 
■NL4-3, which is strictly CXCR4-dependent (13) . PBMC from 

10 two uninfected unexposed (UU) individuals who where 
either homozygous (wt/wt) for the wild type allele (UU1) 
or heterozygous (wt/A) for the deleted allele (UU2) were 
used as controls. 

As expected, PBMC from all four individuals 

15 were readily infected by NL4-3 (Fig. 1A-D) . In contrast, 
both ExUl and ExU2 were completely resistant to infection 
by YU2 (Fig. 1C and D) while both UU1 and UU2 were 
susceptible (Fig. 1A and B) . Infection of both UU with 
YU2 could be inhibited by the HIV-suppressive chemokine 

20 RANTES, a ligand of CCR5 (Fig. 1A and B) . 

The resistance of ExU2 PBMC to infection by 
CCR5-dependent viruses was not restricted to the YU2 
molecular clone, as inoculation of PBMC from ExU2 with 
either two additional CCR5-dependent HIV viruses (JRCSF 

25 and BaL) as well as the HIV virus (V164) isolated from 
the seropositive partner of ExU2, (which, based* on 
susceptibility to neutralization by RANTES is CCR5- 
dependent) did not result in infection. ExU2 PBMC were 
equally resistant to the three HIV-1 isolates as well as 

30 to the clinical isolate Vl64 . In contrast, the control 
UU2 PBMC were infected by all viruses tested. Therefore, 
despite the fact that ExU2 was apparently heterozygous 
for CCR5A32, he was resistant to infection by CCR5- 
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dependent HIV-1. 

The resistance of ExU2 PBMC to in vitro infec- 
tion by CCR5-dependent viruses, together with his healthy 
clinical status despite an extensive history of sexual 
5 exposure, prompted the search for an alternative mutation 
within the non-deleted CCR5 allele. Therefore, the entire 
CCR5 gene of ExU2 was cloned and the nucleotide sequence 
of both alleles was determined. The presence of the A32 
deletion in one of the two alleles was confirmed. In 

10 addition, a single point mutation (T-->A) at position 303 
was found in the' non-deleted allele (thereafter called 
m303) . The remainder of the sequence (SEQ ID NO:14) was 
otherwise identical to the wild type gene (SEQ ID NO: 12) 
(14) . The CCR5m303 mutation generates a stop coddn which 

15 truncates the CCR5 at position 303, resulting in the loss 
of the transmembrane regions 3-7 and the C-terminal 
cytoplasmic end of the molecule. 

The CCR5m303 mutation together with the A32 
deletion could account for the incapacity of macrophage- 

20 tropic HIV isolates to infect ExU2 PBMC if both proteins 
expressed by the two alleles were non-functional. To 
expose this hypothesis, the wild type CCR5 and the two 

mutant alleles were each transfected into CD4 human cells 

and tested for their capacity to generate a functional 

25 protein in an HIV envelope-mediated cell fusion assay 
+ 

using CD4 HeLa cells carrying an integrated HIV long 
terminal repeat (LTR) -driven reporter gene (lacZ) . Env- 
mediated cell fusion was assessed by measuring p- 
galactosidase generated from lacZ when HeLa cells trans- 
30 fected with defective provirus but expressing an HIV env 
gene were combined with CD4+, LTR/lacZ-containing HeLa 
cells transfected with nucleic acid encoding wild type 
CCR5, the CCR5m303 variant or the CCR5A32 variant. If 
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cell fusion via a . . CD4 + . CCR5 . env interaction occurs, 
provirus encoded Tat protein from the env-bearing cells 
can activate the LTR sequence in the CCRS-bearing cells 
and p-galactosidase expression will occur. The amount of 

5 P-galactosidase generated in HeLa CD4 cell depends on the 
induction of the HIV-1 LTR by the Tat protein and is an 
accurate measurement, of £nv-mediated cell fusion. 

As shown in Fig, 2, neither the CCR5m303 or 
the CCR5A32 variant were capable of generating a f unc- 
10 tional, fusion-permissive co-receptor , although wild type 
CCR5 -expressing CD4+ HeLa cells resulted in cell fusion. 
These results suggest that the resistance to infection of 
ExU2 was due to the absence of a functional CCR5 co- 
receptor. 

15 Next, it was tested whether the CCR5m303 

mutant was a de novo mutation that arose in ExU2 or 
rather was inherited as a Mendelian trait. CCR5 genetic 
analysis and PBMC infection assays were performed in two 
immediate relatives of ExU2, the father (2 . F) and his 

20 sister (2 . S) . When tested for the presence of the A3 2 
mutation, it was . found that both father and sister were 
heterozygous for this deletion (Fig. 3B) . 

Next, we took advantage of the presence of a 
Hindi restriction site at position 303 which is lost by 

25 the T — > A mutation, but present in the wild type CCR5 
gene {Fig. 3A> to screen genomic DNA from all individuals 
in this study. The entire CCR5 genomic sequences 
corresponding to ExU2, 2.S, 2.F, ExUl, UU1 and UU2 were 
amplified by PCR. Figure 3C lanes 1-6 shows the uncleaved 

30 amplified, DNA prepared from these six individuals, 
separated on a 1% agarose gel; all alleles showed similar 
patterns of migration. However, digestion with the 
restriction enzyme Hindi generated different restriction 
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patterns in the CCR5 alleles. Two distincts fragments of 
0.7 and 0.4 kb were obtained from the genomic DNA of 
subjects who were either wild type (UU1), homozygous 
* (ExUl) or heterozygous (UU2) for the A32 allele (Fig. 3D, 
5 lanes 4, 5 and 6), In contrast, the pattern of ExU2, 
which carries the CCR5m303 mutation on one allele, 
displayed an additional band migrating with an apparent - 
size of 1,1 kb (Fig. 3D, lane 1). 

To confirm these findings, we analyzed cDNAs 

10 corresponding to CCR5 wild type, CCR5A32 and CCR5m303 
alleles which had been characterized by nucleotide 
sequencing and cloned in a pCR3 . 1 plasmid. All of the 
cDNA products displayed the same restriction enzyme 
characteristics and migration pattern as those observed 

15 for CCR5 alleles directly amplified from genomic DNA 
(Fig. 3C, lanes 7, 8 and 9). Hindi digestion of the 
three cloned genes generated two distinct fragments of 
0,7 and 0,3 kb only within the wild type and CCR5A32 
clones (Fig. 3D, lanes 7 and 8). In contrast, the, 

20 CCR5m303 clone was not digested and displayed the; origi- 
nal migration pattern of the entire CCR5 gene (1.1 kb) 
(Fig. 3D, lane 9) . Upon digestion with Hindi, genomic 
DNA from 2.S but not 2.F, displayed a migration pattern 
" identical to that of ExU2 (Fig. 3D, lanes 2 and 3), 

25 suggesting that the CCR5m303 mutation was inherited by 
both siblings from the mother as a single mendelian 
trait. The presence of the A32 and m303 alleles on cDNA 
from 2.S was confirmed by automatic nucleotide sequencing 
of the corresponding clones. 

30 Since both ExU2 and 2.S carry the same CCR5 

genetic trait, i.e., a combination of the CCR5m303 mutant 
and CCR5A32 deleted alleles, we compared their in vitro 
resistance to infection by a CCR5-dependent virus ( YU2 ) . 
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In addition, we tested PBMC from 2.F, which carry only 
the CCR5A32 deleted allele along with the wild type one. 
As would be expected based upon the lack of functionality 
of the two mutant alleles, 2.S PBMC displayed the same 

5 degree of resistance to infection as ExU2 PBMC whereas 
the 2.F PBMC were fully susceptible to infection with YU2 
(Fig. 3A) . All three individuals were susceptible to in 
vitro infection with NL4-3, the CXCR4-dependent virus 
(Fig. 3B) . 

10 Figure 5A-B. 

Figure 5A Hela P4 cells were cotransf ected 
with different plasmid containing either CCR5 wild type 
(wt) , CCR5A32 (A) or CCR5m303 (m) , with DNA ratio (1/1; 
3/ig/3/ig ) . After 48h, an £nv-mediated cell fusion assay 

15 was conducted with HeLa Ada cells. A £ galactosidase 
assay was realised and the results expressed by the 
.number of blue cells. The results illustrated in said 
Figure 5A suggest that the negative interference of the 
mutant CCR5m303 is more effective than the CCR5A32 

20 mutant . 

Figure 5B : U373 cells were cotransf ected with 
different plasmid containing either CCR5 wild type (wt) , 
CCR5A32 (A) or CCR5m303 (m) , with various DNA ratio (1/1, 
2/xg/2/xg; 1/3, 2ng/6pig; 3/1, 6fig/2{zg) . After 48h, a cell 

25 fusion assay was conducted with HeLa Ada cells. 24 hours 
later a CPRG lysis test assay was conducted and the 
results are expressed with an OD value at 540nm. The 
experience show that with CCR5m303 whatever the ratio the 
same negative interference was observed. On the contrary, 

30 a higher amount of wild type plasmid could suppressed the 
negative interference of the CCR5A32. It seems that the 
negative interference observed with the CCR5A32 is dose 
dependent which is not the case for the CCR5m303. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 
(i) APPLICANT: 

(A) NAME: FONDATION MONDIALE RECHERCHE ET PREVENTION SIDA 

(B) STREET: 1 RUE MIOLLIS 

(C) CITY: PARIS 

(E) COUNTRY: FRANCE 

(F) POSTAL CODE (ZIP) : 75732 CEDEX 15 

(ii) TITLE OF INVENTION: HUMAN IMMUNODEFICIENCY VIRUS CO-RECEPTOR 
VARIANTS ASSOCIATED WITH RESISTANCE TO VIRUS INFECTION. 

(iii) NUMBER OF SEQUENCES: 17 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE : Patentln Release #1.0, Version #1.30 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(2) INFORMATION FOR SEQ ID NO : 2: 



22 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "PRIMER " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
GTCTTCATTA CACCTGCAGC TC 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "PRIMER 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
GTGAAGATAA GCCTCACAGC C 
(2) INFORMATION FOR SEQ ID NO : 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = " PRIMER" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
CCCAAGCTTA TGGATTATCA AGTG 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc » "PRIMER" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GCTCTAGATC ACAAGCCCAC AGA 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "PRIMER" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GGGCAACTAA ATACAT 
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(2) INFORMATION FOR SEQ ID NO : 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "PRIMER" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
GCACAACTCT GACTGG 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "PRIMER" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
TCCTTCTTAC TGTCCCCTTC TGG 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc ^"PRIMER" 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 8: 
CCTGTGCCTC TTCTTCTCAT TTC 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

AATACAATGT GTCAACTCTT G 21 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
(A> LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
AATACAATGT GACAACTCTT G 21 , ' 

(2) INFORMATION FOR SEQ ID NO : 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 461 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

TCCTTCTTAC TGTCCCCTTC TGGGCTCACT ATGCTGCCGC CCAGTGGGAC TTTGGAAATA 60 

CAATGTGACA ACTCTTGACA GGGCTCTATT TTATAGGCTT CTTCTCTGGA ATCTTCTTCA 120 

TCATCCTCCT GACAATCGAT AGGTACCTGG CTGTCGTCCA TGCTGTGTTT GCTTTAAAAG 180 

CCAGGACGGT CACCTTTGGG GTGGTGACAA GTGTGATCAC TTGGGTGGTG GCTGTGTTTG 240 

CGTCTCTCCC AGGAATCATC TTTACCAGAT CTCAAAAAGA AGGTCTTCAT TACACCTGCA 300 

GCTCTCATTT TCCATACAGT CAGTATCAAT TCTGGAAGAA TTTCCAGACA TTAAAGATAG 360 

TCATCTTGGG GCTGGTCCTG CCGCTGCTTG TCATGGTCAT CTGCTACTCG GGAATCCTAA 420 

AAACTCTGCT TCGGTGTCGA AATGAGAAGA AGAGGCACAG G 461 
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(2) INFORMATION FOR SEQ ID NO: 12: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 76 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

<B) LOCATION: 240. .12 98 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

GAATTCCCCC AACAGAGCCA AGCTCTCCAT CTAGTGGACA GGGAAG CTAG CAGCAAACCT 60 

TCCCTTCACT ACAAAACTTC ATTGCTTGGC CAAAAAGAGA GTTAATTCAA TGTAGACATC 120 

TATGTAGGCA ATTAAAAACC TATTGATGTA TAAAACAGTT TGCATTCATG GAGGGCAACT 180 

AAATACATTC TAGGACTTTA TAAAAGATCA CTTTTTATTT ATGCACAGGG TGGAACAAG 239 

ATG GAT TAT CAA GTG TCA AGT CCA ATC TAT GAC ATC AAT TAT TAT ACA 287 
Met Asp Tyr Gin Val Ser Ser Pro He Tyr Asp He Asn Tyr Tyr Thr 
1 5 10 15 

TCG GAG CCC TGC CAA AAA ATC AAT GTG AAG CAA ATC GCA GCC CGC CTC 335 
Ser Glut Pro Cys Gin Lys He Asn Val Lys Gin He Ala Ala Arg Leu 
20 25 30 

CTG CCT CCG CTC TAC TCA CTG GTG TTC ATC TTT GGT TTT GTG GGC AAC 383 
Leu Pro Pro Leu Tyr Ser Leu Val Phe He Phe Gly Phe Val Gly Asn 
35 40 45 

ATG CTG GTC ATC CTC ATC CTG ATA AAC TGC AAA AGG CTG AAG AGC ATG 431 
Met Leu Val He Leu He Leu lie Asn Cys Lys Arg Leu Lys Ser Met 
50 55 60 

ACT GAC ATC TAC CTG CTC AAC CTG GCC ATC TCT GAC CTG TTT TTC CTT 479 
Thr Asp He Tyr Leu Leu Asn Leu Ala He Ser Asp Leu Phe Phe Leu 
65 70 75 80 

CTT ACT GTC CCC TTC TGG GCT CAC TAT GCT GCC GCC CAG TGG GAC TTT 527 
Leu Thr Val Pro Phe Trp Ala His Tyr Ala Ala Ala Gin Trp Asp Phe 
85 90 95 



GGA AAT ACA ATG XXX CAA CTC TTG ACA GGG CTC TAT TTT ATA GGC TTC 
Gly Asn Thr Met YYY Gin Leu Leu Thr Gly Leu Tyr Phe He Gly Phe 
100 105 110 



575 
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TTC TCT GGA ATC TTC TTC ATC ATC CTC CTG ACA ATC GAT AGG TAG CTG 623 
Phe Ser Gly lie Phe Phe lie lie Leu Leu Thr lie Asp Arg tyr Leu 
115 120 125 

GGT GTC GTC GAT GCT GTG TTT GCT TTA AAA GCG AGG ACG GTC ACC TTT 671 
Ala Val Val His Ala Val Phe Ala Leu Lys Ala Arg Thr Val Thr Phe 
130 135 140 

GGG GTG GTG ACA AGT GTG ATC ACT TGG GTG GTG GCT GTG TTT GCG TCT 719 
Gly Val Val Thr Ser Val lie Thr Trp Val Val Ala Val Phe Ala Ser 
145 150 155 160 

CTC CCA GGA ATC ATC TTT ACC AGA TCT CAA AAA GAA GGT CTT CAT TAC 767 
Leu Pro Gly He lie Phe Thr Arg Ser Gin Lys Glu Gly Leu His Tyr 
165 170 175 

ACC TGC AGC TCT CAT TTT CCA TAC AGT CAG TAT CAA TTC TGG AAG AAT 815 
Thr Cys Ser Ser His Phe Pro Tyr Ser Gin Tyr Gin Phe Trp Lys Asn 
180 185 190 

TTC CAG ACA TTA AAG ATA GTC ATC TTG GGG CTG GTC CTG CCG CTG CTT . 863 
Phe Gin Thr Leu Lys lie Val He Leu Gly Leu Val Leu Pro Leu Leu 
195 200 205 

GTC ATG GTC ATC TGC TAC TCG GGA ATC CTA AAA ACT CTG CTT CGG TGT 911 
Val Met Val He Cys Tyr Ser Gly lie Leu Lys Thr Leu Leu Arg Cys 
210 215 220 

CGA AAT GAG AAG AAG AGG CAC AGG GCT GTG AGG CTT ATC TTC ACC ATC .959 
Arg Asn Glu Lys Lys Arg His Arg Ala Val Arg Leu lie Phe Thr He 
225 230 235 240 . 

ATG ATT GTT TAT TTT CTC TTC TGG GCT CCC TAC AAC ATT GTC CTT CTC 1007 
Met lie Val Tyr Phe Leu Phe Trp Ala Pro Tyr Asn He Val Leu Leu 
245 250 255 

CTG AAC ACC TTC CAG GAA TTC TTT GGC CTG AAT AAT TGC AGT AGC TCT 1055 
Leu Asn Thr Phe Gin Glu Phe Phe Gly Leu Asn Asn Cys Ser Ser Ser 
260 265 270 

AAC AGG TTG GAC CAA GCT ATG CAG GTG ACA GAG ACT CTT GGG ATG ACG 1103 
Asn Arg Leu Asp Gin Ala Met Gin Val Thr Glu Thr Leu Gly Met Thr 
275 280 285 

CAC TGC TGC ATC AAC CCC ATC ATC TAT GCC TTT GTC GGG GAG AAG TTC 1151 
His Cys Cys lie Asn Pro lie He Tyr Ala Phe Val Gly Glu Lys Phe 
290 295 300 



AGA AAC TAC CTC TTA GTC TTC TTC CAA AAG CAC ATT GCC AAA CGC TTC 1199 
Arg Asn Tyr Leu Leu Val Phe Phe Gin Lys His lie Ala Lys Arg Phe 
305 310 315 320 
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TGC AAA TGC TGT TCT ATT TTC CAG GAA GAG GCT GCC GAG CGA GCA AGC 124 7 
Cys Lys Cys Cys Ser lie Phe Gin Gin Glu Ala Pro Glu Arg Ala Ser 
325 330 335 

TCA GTT TAC ACC CGA TCC ACT GGG GAG CAG GAA ATA TCT GTG GGC TTG 1295 
Ser Val Tyr Thr Arg Ser Thr Gly Glu Gin Glu He Ser Val Gly Leu 
340 345 350 

TGA CACGGACTCA AGTGGGCTGG TGACCCAGTC AGAGTTGTGC ACATGGCTTA 1348 



GTTTTCATAC ACAGCCTGGG CTGGGGGT 1376 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 353 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Asp Tyr Gin Val Ser Ser Pro lie Tyr Asp He Asn Tyr Tyr Thr 
1 5 10 15 

Ser Glu Pro Cys Gin Lys He Asn Val Lys Gin He Ala Ala Arg Leu 
20 25 30 

Leu Pro Pro Leu Tyr Ser Leu Val Phe He Phe Gly Phe Val Gly Asn 
35 40 45 

Met Leu Val He Leu He Leu lie Asn Cys Lys Arg Leu Lys Ser Met 
50 . 55 60 

Thr Asp He Tyr Leu Leu Asn Leu Ala He Ser Asp Leu Phe Phe Leu 
65 70 75 80 

Leu Thr Val Pro Phe Trp Ala. His Tyr Ala Ala Ala Gin Trp Asp Phe 
85 90 95 

Gly Asn Thr Met Cys Gin Leu Leu Thr Gly Leu Tyr Phe He Gly Phe 
100 105 110 

Phe Ser Gly He Phe Phe He He Leu Leu Thr He Asp Arg Tyr Leu 
115 120 125 

Ala Val Val His Ala Val Phe Ala Leu Lys Ala Arg Thr Val Thr Phe 
130 135 140 



Gly Val Val Thr Ser Val He Thr Trp Val 
145 150 



Val Ala Val Phe Ala Ser 
155 160 
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Leu Pro Gly lie lie Phe Thr Arg Ser Gin Lys Glu Gly Leu His Tyr 

165 170 175 

Thr Cys Ser Ser His Phe Pro Tyr Ser Gin Tyr Gin Phe Trp Lys Asn 

180 185 * 190 

Phe Gin Thr Leu Lys He Val He Leu Gly Leu Val Leu Pro Leu Leu 

195 200 205 

Val Met Val He Cys Tyr Ser Gly He Leu Lys Thr Leu Leu Arg Cys 

210 215 220 

Arg Asn Glu Lys Lys Arg His Arg Ala Val Arg Leu He Phe Thr He 

225 230 235 240 

Met He Val Tyr Phe Leu Phe Trp Ala Pro Tyr Asn He Val Leu Leu 

245 250 255 

Leu Asn Thr Phe Gin Glu Phe Phe Gly Leu Asn Asn Cys Ser Ser Ser 

260 265 270 

Asn Arg Leu Asp Gin Ala Met Gin Val Thr Glu Thr Leu Gly Met Thr 

275 280 285 

His Cys Cys He Asn Pro He He Tyr Ala Phe Val Gly Glu Lys Phe 

290 295 300 

Arg Asn Tyr Leu Leu Val Phe Phe Gin Lys His He Ala Lys Arg Phe 

305 310 315 320 

Cys Lys Cys Cys Ser He Phe Gin Gin Glu Ala Pro Glu Arg Ala Ser 

325 330 335 

Ser Val Tyr Thr Arg Ser Thr Gly Glu Gin Glu lie Ser Val Gly Leu 

340 345 350 



(?) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1071 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 7. .309 



1 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

AAGCTT ATG GAT TAT CAA GTG TCA AGT CCA ATC TAT GAC ATC AAT TAT 48 
Met Asp Tyr Gin Val Ser Ser Pro lie Tyr Asp lie Asn^ Tyr 
355 360 365 

TAT ACA TCG GAG CCC TGC CAA AAA ATC AAT GTG AAG CAA ATC GCA GCC 96 
Tyr Thr Ser Glu Pro Cys Gin Lys lie Asn Val Lys Gin lie Ala Ala 
370 375 380 

CGC CTC CTG CCT CCG CTC TAC TCA CTG GTG TTC ATC TTT GGT TTT GTG 144 
Arg Leu Leu Pro Pro Leu Tyr Ser Leu Val Phe lie Phe Gly Phe Val 
385 390 395 

GGC AAC ATG CTG GTC ATC CTC ATC CTG ATA AAC TGC AAA AGG CTG AAG 192 
Gly Asn Met Leu Val lie Leu lie Leu lie Asn Cys Lys Arg Leu Lys 
400 405 410 415 

AGC ATG ACT GAC ATC TAC CTG CTC AAC CTG GCC ATC TCT GAC CTG TTT 240 
Ser Met Thr Asp lie Tyr Leu Leu Asn Leu Ala lie Ser Asp Leu Phe 
420 425 430 

TTC CTT CTT ACT GTC CCC TTC TGG GCT CAC TAT GCT GCC GCC CAG TGG 288 
Phe Leu Leu Thr Val Pro Phe Trp Ala His Tyr Ala Ala Ala Gin Trp 
435 440 445 

GAC TTT GGA AAT ACA ATG TGA CAACTCTTGA CAGGGCTCTA TTTTATAGGC 33 9 
Asp Phe Gly Asn Thr Met * 
450 

TTCTTCTCTG GAATCTTCTT CATCATCCTC CTGACAATCG ATAGGTACCT GGCTGTCGTC „ 39? 

CATGCTGTGT TTGCTTTAAA AGCCAGGACG GTCACCTTTG GGGTGGTGAC AAGTGTGATC -459 

ACTTGGGTGG TGGCTGTGTT TGCGTCTCTC CCAGGAATCA TCTTTACCAG ATCTCAAAAA 519 

GAAGGTCTTC ATTACACCTG CAGCTCTCAT TTTCCATACA GTCAGTATCA ATTCTGGAAG 579 

AATTTCCAGA CATTAAAGAT AGTCATCTTG GGGCTGGTCC TGCCGCTGCT TGTCATGGTC 63 9 

ATCTG CTACT CGGGAATCCT AAAAACTCTG CTTCGGTGTC GAAATGAGAA GAAGAGGCAC 6 99 

AGGGCTGTGA GGCTTATCTT CACCATCATG ATTGTTTATT TTCTCTTCTG GGCTCCCTAC 759 

AACATTGTCC TTCTCCTGAA CACCTTCCAG GAATTCTTTG GCCTGAATAA TTGCAGTAGC 819 

TCTAACAGGT TGGACCAAGC TATGCAGGTG ACAGAGACTC TTGGGATGAC GCACTGCTGC 879 

ATCAACCCCA TCATCTATGC CTTTGTCGGG GAGAAGTTCA GAAACTACCT CTTAGTCTTC 939 

TTCCAAAAGC ACATTGCCAA ACGCTTCTGC AAATGCTGTT CTATTTTCCA GCAAGAGGCT 999 
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CCCGAGCGAG CAAGCTCAGT TTACACCCGA TCCACTGGGG AGCAGGAAAT ATCTGTGGGC 1059 
TTGTGATCTA GA 1071 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Met Asp Tyr Gin Val Ser Ser Pro lie Tyr Asp lie Asn Tyr Tyr Thr 
15 10 15 



Ser Glu Pro Cys Gin Lys lie Asn Val Lys Gin lie Ala Ala Arg Leu 
20 25 30 

Leu Pro Pro Leu Tyr Ser Leu Val Phe lie Phe Qly Phe Val Gly Asn 
35 40 45 

Met Leu Val He Leu He Leu He Asn Cys Lys Arg Leu Lys Ser Met 
50 55 60 

Thr Asp He Tyr Leu Leu Asn Leu Ala He Ser Asp Leu Phe Phe Leu 
65 70 ' 75 80 

Leu Thr Val Pro Phe Trp Ala His Tyr Ala Ala Ala Gin Trp Asp Phe 
85 90 95 

Gly Asn Thr Met * 
100 



(2) INFORMATION FOR SEQ ID NO : 16: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1344 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



(ix) 



FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 240. .887 



WO 98/54317 



PCIYEP98/03437 



38 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

GAATTCCCCC AACAGAGCCA AGCTCTCCAT CTAGTGGACA GGGAAGCTAG CAGCAAACCT 60 

TCCCTTCACT ACAAAACTTC ATTGCTTGGC CAAAAAGAGA GTTAATTCAA TGTAGACATC 120 

TATGTAGGCA ATTAAAAACC TATTGATGTA TAAAACAGTT TGCATTCATG GAGGGCAACT 180 

AAATACATTC TAGGACTTTA TAAAAGATCA CTTTTTATTT ATGCACAGGG TGGAACAAG 239 

ATG GAT TAT CAA GTG TCA AGT CCA ATC TAT GAC ATC AAT TAT TAT ACA 287 
Met Asp Tyr Gin Val Ser Ser Pro lie Tyr Asp lie Asn Tyr Tyr Thr 
105 110 115 

TCG GAG CCC TGC CAA AAA ATC AAT GTG AAG CAA ATC GCA GCC CGC CTC 335 
Ser Glu Pro Cys Gin Lys lie Asn Val Lys Gin lie Ala Ala Arg Leu 
120 125 130 

CTG CCT CCG CTC TAC TCA CTG GTG TTC ATC TTT GGT TTT GTG GGC AAC 383 
Leu Pro Pro Leu Tyr Ser Leu Val Phe lie Phe Gly Phe Val Gly Asn 
135 140 145 

ATG CTG GTC ATC CTC ATC CTG ATA AAC TGC AAA AGG CTG AAG AGC ATG 431 
Met Leu Val lie Leu lie Leu lie Asn Cys Lys Arg Leu Lys Ser Met 
150 155 160 165 

ACT GAC ATC TAC CTG CTC AAC CTG GCC ATC TCT GAC CTG TTT TTC CTT 479 
Thr Asp lie Tyr Leu Leu Asn Leu Ala lie Ser Asp Leu Phe Phe Leu 
7 170 175 180 

CTT ACT GTC CCC TTC TGG GCT CAC TAT GCT GCC GCC CAG TGG GAC TTT 527 
Leu Thr Val Pro Phe Trp Ala His Tyr Ala Ala Ala Gin Trp Asp Phe 
185 190 195 

GGA AAT ACA ATG TGT CAA CTC TTG ACA GGG CTC TAT TTT ATA GGC TTC 575 
Gly Asn Thr Met Cys Gin Leu Leu Thr Gly Leu Tyr Phe lie Gly Phe 
200 205 210 

TTC TCT GGA ATC TTC TTC ATC ATC CTC CTG ACA ATC GAT AGG TAC CTG 623 
Phe Ser Gly lie Phe Phe He He Leu Leu Thr lie Asp Arg Tyr Leu 
215 220 225 

GCT GTC GTC CAT GCT GTG TTT GCT TTA AAA GCC AGG ACG GTC ACC TTT 671 
Ala Val Val His Ala Val Phe Ala Leu Lys Ala Arg Thr Val Thr Phe 
230 235 240 245 

GGG GTG GTG ACA AGT GTG ATC ACT TGG GTG GTG GCT GTG TTT GCG TCT 719 
Gly Val Val Thr Ser Val He Thr Trp Val Val Ala Val Phe Ala Ser 
250 255 260 

CTC CCA GGA ATC ATC TTT ACC AGA TCT CAA AAA GAA GGT CTT CAT TAC 767 
Leu Pro Gly He He Phe Thr Arg Ser Gin Lys Glu Gly Leu His Tyr 
265 270 275 
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ACC TGC AGC TCT CAT TTT CCA TAC ATT AAA GAT AGT CAT CTT GGG GCT 815 
Thr Cys Ser Ser His Phe Pro Tyr lie Lys Asp Set His Leu Gly Ala 
280 285 290 

GGT CCT GCC GCT GCT TGT CAT GGT CAT CTG CTA CTC GGG AAT CCT AAA 863 
Gly Pro Ala Ala Ala Cys His Gly His Leu Leu Leu Gly Asn Pro Lys 
295 300 305 

AAC TCT GCT TCG GTG TCG AAA TGA GAAGAAGAGG CACAGGGCTG TGAGGCTTAT 917 
Asn Ser Ala Ser Val Ser Lys * 
310 315 

CTTCACCATC ATGATTGTTT ATTTTCTCTT CTGGGCTCCC TACAACATTG TCCTTCTCCT 977 

GAACACCTTC CAGGAATTCT TTGGCCTGAA TAATTGCAGT AG CTCTAACA GGTTGGACCA 1037 

AGCTATGCAG GTG AC AGAGA CTCTTGGGAT GACGCACTGC TGCATCAACC CCATCATCTA 1097 

TGCCTTTGTC GGGGAGAAGT TCAGAAACTA CCTCTTAGTC TTCTTCCAAA AGCACATTGC 1157 

CAAACGCTTC TGCAAATGCT GTTCTATTTT CCAGCAAGAG GCTCCCGAGC GAGCAAGCTC 1217 

AGTTTACACC CGATCCACTG GGGAGCAGGA AATATCTGTG GGCTTGTGAC ACGGACTCAA 1277 

GTGGGCTGGT GACCCAGTCA GAGTTGTGCA CATGGCTTAG TTTTCATACA CAGCCTGGGC 1337 

TGGGGGT 1344 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 216 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Met Asp Tyr Gin Val Ser Ser Pro lie Tyr Asp lie Asn Tyr Tyr Thr 
1 5 10 15 

Ser Glu Pro Cys Gin Lys lie Asn Val Lys Gin lie Ala Ala Arg Leu 
20 25 30 

Leu Pro Pro Leu Tyr Ser Leu Val Phe lie Phe Gly Phe Val Gly Asn 
35 40 45 

Met Leu Val He Leu He Leu He Asn Cys Lys Arg Leu Lys Ser Met 
50 55 60 

Thr Asp He Tyr Leu Leu Asn Leu Ala He Ser Asp Leu Phe Phe Leu 
65 70 75 80 
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Leu Thr Val Pro Phe Trp Ala His 
85 

Gly Asn Thr Met Cys Gin Leu Leu 
100 

Phe Ser Gly lie Phe Phe lie lie 
115 120 

Ala Val Val His Ala Val Phe Ala 
130 135 

Gly Val Val Thr Ser Val lie Thr 
145 150 

Leu Pro Gly lie He Phe Thr Arg 
165 

Thr Cys Ser Ser His Phe Pro Tyr 
180 

Gly Pro Ala Ala Ala Cys His Gly 

195 200 
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tyr Ala Ala Ala Gin Trp Asp Phe 
90 95 

Thr Gly Leu Tyr Phe He Gly Phe 
105 HO 

Leu Leu Thr He Asp Arg Tyr Leu 
125 

Leu Lys Ala Arg Thr Val Thr Phe 
140 

Trp Val Val Ala Val Phe Ala Ser 
155 160 

Ser Gin Lys Glu Gly Leu His Tyr 
170 175 

He Lys Asp Ser His Leu Gly Ala 
185 190 

His Leu Leu Leu Gly Asn Pro Lys 
205 



Asn Ser Ala Ser Val Ser Lys 
210 215 
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Claims 

1°) A purified CCR5 variant protein which 
comprises the first two transmembrane domains of wild 
type CCR5 but which lacks transmembrane domains 3-7. 
5 2°) The purified CCR5 protein variant of claim 

1, which is the CCR5 variant CCR5m303 having the amino 
acid sequence set forth in SEQ ID NO: 15; 

3°) The purified CCR5 variant protein of claim 

2, which is comprised together with heterologous amino 
10 acid sequence in a fusion protein. 

4°) A purified and isolated nucleic acid 
sequence encoding the protein of claim 1 or of claim 2 . 

5°) The nucleic acid of claim 4, which has the 
sequence set forth in SEQ ID NO: 12 where XXX is a stop 
15 codon or in SEQ ID NO: 14. 

6°) Isolated DNA primers for the amplification 
of the nucleic acid of claim 4, selected from the group 
consisting of nucleotide fragments, which comprise 
between 15 and 100 nucleotides of a sequence upstream or 
20 downstream nucleotide 540, with reference to SEQ ID NO: 
12. 

7°) Isolated DNA primers for the amplification 
of the nucleic acid of claim 4, selected from the group 
consisting of : 
25 SEQ ID NO:l and SEQ ID NO: 2 

SEQ ID NO: 3 and SEQ ID NO: 4 

SEQ ID NO: 5 and SEQ ID NO: 6 

SEQ ID NO: 7 and SEQ ID NO: 8 

8°) Isolated DNA probes for the detection of 
30 the nucleic acid of claim 4, selected from the group 
consisting of SEQ ID NO: 10 and SEQ ID NO: 11. 

9°) Method of detecting and identifying the 
presence of at least a CCR5 variant in a biological 
sample, comprising : 
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- bringing into contact the biological sample 
with a pair of primers according to claim 6, 

- amplifying the CCR5 variant, eventually 
present in said biological sample 

5 - demonstrating the amplification of the DNA 

fragment corresponding to the fragment flanked by the 
primers, for example by gel electrophoresis, 

- optionally verifying the sequence of the 
amplified fragment, for example by specific probe 

10 hybridization, sequencing or restriction site analysis. 

10° ) Method according to claim 9, wherein said 
pair of primers are selected from the group consisting: 
of: . 

SEQ ID NO:l and SEQ ID NO : 2 
15 SEQ ID NO: 3 and SEQ ID NO: 4 

SEQ ID NO: 5 and SEQ ID NO: 6 
SEQ ID NO: 7 and SEQ ID NO: 8, 

11°) Method according to claim 9, wherein said 
hybridization probes are selected in the group consisting 
20 of SEQ ID NO :10and SEQ ID NO : 11. 

12°) A kit ready for use for detecting at 
least a CCR5 variant in a biological sample, comprising :„ 

DNA primers as claimed in claim 6 or in claim 

7, 

25 reagents for extracting the DNA of said 

biological sample, 

reagents for amplifying DNA comprising 

polymerization enzymes, 

optionally, at least one oligonucleotide probe 
30 which specifically hybridizes to the amplified DNA, as 
claimed in claim 8. 

13°) A cell into which the nucleic acid of 
claim 4 or claim 5 has been introduced. 

14°) The nucleic acid of claim 4 or claim 5 as 
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contained in a vector. 

15°) A method of determining whether an indi- 
vidual is resistant to infection by a macrophage-tropic 
strain of human immunodeficiency virus type 1, comprising 
5 determining whether a CCR5 variant is present in a sample 
collected from the individual, wherein ' the CCR5 variant 
comprises the first two membrane domains of wild type 
. CCR5 but lacks transmembrane domains 3-7, wherein the 
presence of such a variant bears a positive correlation 
10 with a resistance to the infection. 

16°) The method of claim 15, wherein the CCR5 

is CCR5m303. 

17°) A method for determining the likelihood 
that an individual will be infected by a macrophage- 

15 tropic strain of human immunodeficiency virus type ; 1, 
comprising determining whether a CCR5 variant is present 
in a sample collected from the individual, wherein the 
CCR5 variant comprises the first two transmembrane 
domains of wild type CCR5 but lacks transmembrane domains 

20 3-7, wherein the presence of such a variant bears a posi- 
tive correlation with a lower probability of infection. 

18°) A method of evaluating the prognosis of 
an individual infected with a macrophage-tropic strain -of 
human immunodeficiency virus type 1, comprising 

25 determining whether a CCR5 variant is present in a sample 
collected from the individual, wherein the CCR5 variant 
comprises the first two transmembrane domains of wild 
type CCR5 but lacks transmembrane domains 3-7, wherein 
the presence of such a variant bears a positive correla- 

30 tion with a slower progression of the disease. 

19°) A method of determining whether an indi- 
vidual is resistant to infection by a macrophage-tropic 
strain of human immunodeficiency virus type 1, comprising 
characterizing both alleles of the individual, wherein a 
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determination that both CCR5 alleles of an individual 
encode a CCR5 variant which either (i) comprises the 
first two transmembrane domains of wild type CCR5 but 
which lacks transmembrane domains 3-7 or (ii) is func- 

5 tionally defective (e.g.) cannot be expressed at the cell 
surface) has a positive correlation with a resistance of 
the individual to infection. 

20°) The method of claims 15 to 19, wherein 
the presence of the CCR5 variant is determined by identi- 

10 fying a nucleic acid which encodes said variant. 

21°) The method of claims 15 to 19, wherein 
said identifying step is performed according to the 
method of claims 9 to 11. 

22°) The method of claim 19, wherein one CCR5 

15 allele is a CCR5m303 variant and wherein the other CCR5 
allele is a CCR5A32 variant. 

23 9 ) A method of determining whether an indi- 
vidual is less likely to be infected by a macrophage- 
tropic strain of human immunodeficiency virus type 1, 

20 comprising characterizing both CCR5 alleles of the indi- 
vidual, wherein a determination that one CCR5 allele is a 
wild type CCR5 allele and the other allele is a CCR5 
variant which comprises the first two transmembrane 
domains of wild type CCR5 but which lacks transmembrane 

25 domains 3-7 has a positive correlation with a , lower 
probability of infection relative to homozygous CCRSwild 
type individuals. 

24°) A method of evaluating the prognosis of 
an individual infected with a macrophage-tropic strain of 

30 human immunodeficiency virus type 1, comprising charac- 
terizing both CCR5 alleles of the individual, wherein a 
determination that one CCR5 allele is a wild type CCR5 
allele and the other is a CCR5 variant which comprises 
the first two transmembrane domains of the wild type CCR5 
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but which lacks transmembrane domains 3-7 has a positive 
correlation with a slower progression of disease. 

25°) Composition characterized in that it 
comprises a nucleic acid according to claim 4 or 5, for 
5 the preparation of a drug for inhibiting CCR5-mediated 

HIV infection. 

26°) Method of detecting a CCRS variant 
protein according to claims 1 to 3, wherein it comprises: 
. - contacting a biological sample with an 
10 antibody directed against said CCR5 variant protein, and 

- detecting the complex CCRS variant protein - 
antibodies by an appropriate means. 

27°) Antibodies, characterized in that they 
are directed against a CCR5 variant protein according to 
15 claims 1 to 3 . 
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